MalFSM: Feature Subset Selection Method for Malware Family Classification

نویسندگان

چکیده

Malware detection has been a hot spot in cyberspace security and academic research. We investigate the correlation between opcode features of malicious samples perform feature extraction, selection fusion by filtering redundant features, thus alleviating dimensional disaster problem achieving efficient identification malware families for proper classification. authors use obfuscation technology to generate large number variants, which imposes heavy analysis burden on researchers consumes lot resources both time space. To this end, we propose MalFSM framework. Through method, reduce 735 contained Kaggle dataset 16, then fuse metadata (count file lines size) total 18 find that machine learning classification is high accuracy. analyzed interpreted selected features. Our comprehensive experiments show highest accuracy can reach up 98.6% only 7.76 s Microsoft.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Selection for Malware Classification

In applying machine learning to malware identification, different types of features have proven to be successful. These features have also been tested with different kinds of classification methodologies and have had varying degrees of success. Every time a new machine learning methodology is introduced for classifying malware, there is the potential for increasing the overall quality of malwar...

متن کامل

Feature Selection and Extraction for Malware Classification

The explosive amount of malware continues their threats in network and operating systems. Signature-based method is widely used for detecting malware. Unfortunately, it is unable to determine variant malware on-the-fly. On the hand, behavior-based method can effectively characterize the behaviors of malware. However, it is time-consuming to train and predict for each specific family of malware....

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

Image Classification Using Feature Subset Selection

Classification technology is essential for fast retrieval in large database. This paper proposes a combining GA and SVM model to content-based image retrieval. The proposed method is also used to classification similar images from database. Joint HSV histogram and average entropy computed from gray-level co-occurrence matrices in the localized image region is employed as input vectors. Genetic ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Chinese Journal of Electronics

سال: 2023

ISSN: ['1022-4653', '2075-5597']

DOI: https://doi.org/10.23919/cje.2022.00.038